Outlier Diagnostics in Logistic Regression: A Supervised Learning Technique

نویسندگان

  • A. A. M. Nurunnabi
  • Mohammed Nasser
چکیده

The goal of supervised learning is to build a concise model of the distribution of class labels in terms of predictor features. Logistic regression is one of the most popular supervised learning technique that is used in classification. Fields like computer vision, image analysis and engineering sciences frequently encounter data with outliers (noise). Presence of outliers in the training sample may be the cause of large training time, misclassification, and to design a faulty classifier. This article provides a new method for identifying outliers in logistic regression. The significance of the measure is shown by well-referred data sets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finding Anomaly With Fuzzy C-means ANN Using Semi-Supervised Approach

The FC-ANN (Artificial Neural Network) is used to speed up the technique. The anomaly Outlier detection is primary in various data-mining applications. Outlier detection methods have been suggested for number of application such as, fraud detection, voting irregularity analysis, data cleansing, clinical trials, network intrusion, severe weather prediction, geographic information system, credit ...

متن کامل

Sublinear Algorithms for Penalized Logistic Regression in Massive Datasets

Penalized logistic regression (PLR) is a widely used supervised learning model. In this paper, we consider its applications in largescale data problems and resort to a stochastic primal-dual approach for solving PLR. In particular, we employ a random sampling technique in the primal step and a multiplicative weights method in the dual step. This technique leads to an optimization method with su...

متن کامل

The Effect of Transitive Closure on the Calibration of Logistic Regression for Entity Resolution

This paper describes a series of experiments in using logistic regression machine learning as a method for entity resolution. From these experiments the authors concluded that when a supervised ML algorithm is trained to classify a pair of entity references as linked or not linked pair, the evaluation of the model’s performance should take into account the transitive closure of its pairwise lin...

متن کامل

Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data

Outlier detection is useful for credit card fraud detection. Due to drastic increase in digital frauds, there is a lot of financial losses and therefore various techniques are developed for fraud detection and applied to diverse business fields. In high-dimensional data, outlier detection presents some challenges because of increment of dimensionality. In this paper, the proposed model aims to ...

متن کامل

Episodic Reinforcement Learning by Logistic Reward-Weighted Regression

It has been a long-standing goal in the adaptive control community to reduce the generically difficult, general reinforcement learning (RL) problem to simpler problems solvable by supervised learning. While this approach is today’s standard for value function-based methods, fewer approaches are known that apply similar reductions to policy search methods. Recently, it has been shown that immedi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011